Classification Using Mass Spectrometry Proteomic Data with Kernel-Based Algorithms

نویسندگان

  • Zhenqiu Liu
  • Shili Lin
چکیده

Motivation: Early detection of cancer is crucial for successful treatment, and protein profiling using mass spectrometry (MS) data has been investigated as a potential tool. However, due to the high correlation and huge dimensionality of MS data, it is crucial to modify existing algorithms and to develop new ones, where necessary, for analyzing such data. Results: We develop a group of logistic regression kernel coupling algorithms for classification of normal versus cancer samples. Furthermore, we propose a systematic three-step protocol for analyzing MS data, from removal of baseline noise and normalization, to feature extraction and reduction, and finally to classification. The systematic analysis paradigm and the proposed algorithms were applied to an ovarian and a prostate cancer dataset. The results show that our proposed approaches can be an effective tool for analyzing MS proteomics data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Computational Algorithms for the Classification of Liver Cancer using SELDI Mass Spectrometry: A Case Study

INTRODUCTION As an alternative to DNA microarrays, mass spectrometry based analysis of proteomic patterns has shown great potential in cancer diagnosis. The ultimate application of this technique in clinical settings relies on the advancement of the technology itself and the maturity of the computational tools used to analyze the data. A number of computational algorithms constructed on differe...

متن کامل

Diagnosis of Diabetes Using an Intelligent Approach Based on Bi-Level Dimensionality Reduction and Classification Algorithms

Objective: Diabetes is one of the most common metabolic diseases. Earlier diagnosis of diabetes and treatment of hyperglycemia and related metabolic abnormalities is of vital importance. Diagnosis of diabetes via proper interpretation of the diabetes data is an important classification problem. Classification systems help the clinicians to predict the risk factors that cause the diabetes or pre...

متن کامل

Nonparametric Models for Proteomic Peak Identification and Quantification

We present model-based inference for proteomic peak identification and quantification from mass spectroscopy data, focusing on nonparametric Bayesian models. Using experimental data generated from MALDI-TOF mass spectroscopy (Matrix Assisted Laser Desorption Ionization Time of Flight) we model observed intensities in spectra with a hierarchical nonparametric model for expected intensity as a fu...

متن کامل

Proteomic Analysis of Gene Expression in Basal Cell Carcinoma

Background: Basal Cell Carcinoma (BCC) is a type of non-melanoma skin cancer. Alteration in gene expression is the important event that happens in cancer cell. Detection of this event is possible by proteomics techniques. Methods: Normal and tumor tissues were taken from BCC patient. Total proteins were purified by standard methods, and proteins were separated by two-dimensional electrophoresis...

متن کامل

Impact of Patients’ Gender on Parkinson’s disease using Classification Algorithms

In this paper the accuracy of two machine learning algorithms including SVM and Bayesian Network are investigated as two important algorithms in diagnosis of Parkinson’s disease. We use Parkinson's disease data in the University of California, Irvine (UCI). In order to optimize the SVM algorithm, different kernel functions and C parameters have been used and our results show that SVM with C par...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006